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Abstract 

The two principal/immediate influences — which we seek to interrelate here — upon the under- 
taking of this study are papers of Zyczkowski and Slomczyhski (J. Phys. A 34, 6689 [2001]) and 



' of Petz and Sudar (J. Math. Phys. 37, 2262 [1996]). In the former work, a metric (the Monge 

one, specifically) over generalized Husimi distributions was employed to define a distance between 

o , 

two arbitrary density matrices. In the Petz-Sudar work (completing a program of Chentsov), the 



quantum analogue of the (classically unique) Fisher information (montone) metric of a probability 



simplex was extended to define an uncountable infinitude of Riemannian (also monotone) metrics 
on the set of positive definite density matrices. We pose here the questions of what is the spe- 

a, 

ci&c/unique Fisher information metric for the (classically-defined) Husimi distributions and how 



does it relate to the infinitude of (quantum) metrics over the density matrices of Petz and Sudar? 
We find a highly proximate (small relative entropy) relationship between the probability distribu- 
tion (the quantum Jeffreys' prior) that yields quantum universal data compression, and that which 
(following Clarke and Barron) gives its classical counterpart. We also investigate the Fisher infor- 
mation metrics corresponding to the escort Husimi, positive-P and certain Gaussian probability 
distributions, as well as, in some sense, the discrete Wigner pseudoprobability. The comparative 
noninformativity of prior probability distributions — recently studied by Srednicki (Phys. Rev. 
A 71, 052107 [2005]) — formed by normalizing the volume elements of the various information 
metrics, is also discussed in our context. 

PACS numbers: Valid PACS 03.67.-a, 02.50.Tt, 02.50.Sk, 02.40.Ky 
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I. INTRODUCTION 



The two-level quantum systems (TLQS) are describable (nonclassically) in terms of 2 x 
2 density matrices (p) — Hermitian nonnegative definite matrices of trace unity These 
matrices can be parametrized by points in the unit ball (Bloch ball/sphere [l|, p. 10244]) 
in Euclidean 3-space. On the other hand, the TLQS can be described in a classical manner 
using a g ene raliz a«on of the H Usimi Q Q S ec. 4.1] (of. fl fl Q fl Q Q). 

"The Husimi function is a function on phase space, and takes only non-negative values 
while the Wigner function can be negative and is usually violently oscillating. Hence the 
Husimi function can be regarded as a probability distribution in phase space, and its order of 
derealization can be a measure of chaoticity of quantum states" ^(|. (Note that the original 
Husimi distribution was defined only for density operators in separable Hilbert space — one 
which admits a countable orthonormal basis — while the distribution to be studied here is 
defined over a finite-dimensional Hilbert space.) 

There is an (uncountable) infinitude jljj, sec. 16.7] of (quantum monotone) Riemannian 
metrics that can be attached to the Bloch ball of TLQS. Contrastingly, in the classical 
context of the Husimi distribution, there is not an infinitude, but rather a single distinguished 
up to a constant multiple) monotone Riemannian metric — the Fisher information metric 
12) Q]- ("In the classical case, decision theory provides a unique montone metric, 



12, 



namely, the Fisher information. In the quantum case, there are infinitely many monotone 
metrics on the state space" lfil p. 2672].) So, it appears to be an question of obvious interest 
- which we seek to address here — of how one reconciles/deals with this phenomenon of 
classical uniqueness and quantum non-uniqueness, as applied to essentially the same objects 
(that is, the TLQS). 



II. MONOTONE METRICS 

The monotone metrics are all stochastically monotone |15[. That is, geodesic dis- 
tances (as well as relative entropies) between density matrices decrease under coarse- 
grainings (completely positive trace-preserving maps, satisfying the Schwarz inequality: 



T(a*a) 

ity L 



> T(a)*T(a)). These metrics can be used for purposes of statistical distinguishabil- 
. The monotone metrics for the TLQS have been found to be rotationally invariant 
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over the Bloch ball, depending only on the radial coordinate r, that is the distance of the 
state in question from the origin (0, 0, 0) — corresponding to the fully mixed state. They 
are splittable into radial and tangential components of the form ILJ, eq. (3.17)], 

1 . o , .,1-rw- 1 



ds 2 monotone = Y^ dr2 + (J 1 + r ^^Y^) dn2 - W 

Here, using spherical coordinates (r, 61,62), one has dn 2 = r 2 d6\ + r 2 sin 2 6\d6\. Further, 
/ : 1R + — > R + is an operator monotone function such that fit) = tf{t~ l ) for every t > 0. 
(A function is operator monotone if the relation < K < H , meaning that H — K is 
nonnegative definite, implies < f(K) < f(H) for any such matrices K and H of any 
order.) The radial component is independent of the function /, and in the case of the Bures 
(minimal monotone) metric (corresponding to the particular choice fBuresif) = ^r), the 
tangential component is independent of r 16]. 



In the classical context of the Husimi distribution, there is not an infinitude, but rather a 
single distinguished (up to a constant multiple) monotone metric — the Fisher information 
metric Q, ^| . (The counterpart here to stochastic mappings — which are the appro- 
priate morphisms in the category of quantum state spaces — are stochastic matrices |l5|.) 
The ij-entry of the Fisher information matrix (tensor) is the expected value with respect to 
the probability distribution in question of the product of the first derivative of the logarithm 
of the probability with respect to its z-th parameter times the analogous first derivative with 
respect to its j-th parameter. (Under certain regularity conditions, the Fisher information 
matrix is equal to the "second derivative matrix for the informational divergence (relative 
entropy)" pp. 455-456], p. 43].) The volume element of the Fisher information 
metric can be considered — in the framework of Bayesian theory — as a prior distribution 
(Jeffreys' prior [ill [3, 0| ) over, for our purposes here, the Bloch ball of TLQS. 

A. Fisher information metric for the Husimi distribution 

We have found (having to make use of numerical, as well as symbolic MATHEMAT- 
ICA procedures in our quest) that for the Husimi distribution over the TLQS, the Fisher 
information metric takes the specific form (cf. ©), 

ds% sherHus = ~ 2r ~|° 3 gfe) ^ 2 + ((1 + r)/^ s (^)) _1 dn 2 . (2) 
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Too t 

FIG. 1: The monotone function fHus{t) that yields the tangential component of the Fisher in- 
formation metric over the trivariate Husimi probability distributions for the two-level quantum 
systems. 

Here, 

Now, a plot (Fig. H} shows /ff us (t) to be, in fact, a monotone function. (/ff us (t) is "almost" 
equal to J^t-i = ^ ~ !•) It nas a singularity at t — 1, corresponding to the fully mixed 
state (r = 0), where /ff us (l + At) « 3 + 3Ai/2, though we have not attempted to confirm 
its operator monotonicity. Also, fnus{t) fulfills the self-adjointness condition /(£) = t/(£ _1 ) 
of Petz and Sudar p. 2667], at least at t ^ 1. For the pure states, that is t = 0, r — 1, 
we have lim^ /to (*) = 1- 
We further have the relation, 

1 g 2 - p 2 - 2pq log £ 

Cf, "' (p ' ?)= isu!r — — • (4) 

where ch U s{p,q) is a specific "Morozova-Chentsov" function. There exist one-to-one corre- 
spondences between Morozova-Chentsov functions, monotone metrics and operator means 



21 



Cor. 6]. "Operator means are binary operations on positive operators which fulfill the 
main requirements of monotonicity and the transformer inequality" j^ . 
We can write more explicitly as 

-2r-log(i?) 2r + (1 - r 2 ) log(^) „ 

Certainly, ds 2 FisherHus does not have — in terms of the radial component — the specific 



form (0) required of a monotone metric (cf. 22j). In Fig. |21 we show both the radial 
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FIG. 2: The radial components of any monotone metric and that of the Fisher information 
metric derived from the family of trivariate Husimi distributions over the TLQS. The one for the 
(nondenumerably infinite) class ds^ 



Monotone dominates that for ds 2 F 

isheruus ' 



components of (any) ds 2 mcmotone and of dsL,;™.. . Petz 



Fisher Hu3 



Hp- 



nature 



l_ r 2 



934] attributes the unvarying 
of the radial component of the (quantum) monotone metrics to the (classical) 
Chentsov uniqueness (of Fisher information) theorem 12, 3]. "Loosely speaking, the unicity 
[sic] result in the [probability] simplex case survives along the diagonal and the off-diagonal 
provides new possibilities for the definition of a stochastically invariant metric" p. 2664]. 

If we (counterfactually) equate the volume element of ds 2 FisherHus to that of a generic 
monotone metric and solve for f(t), we obtain a mo notonically- decreasing function 

(Fig. ED (cf. I22I), 

\/2(-l 



counter f actual 



(t) 



tr- 



ie) 



t(l + t) ^(-l + t 2 -2t log(t)) 2 (2-2t+ (1 + t) log(t)) 

Converting to cartesian coordinates (x, y, z), the trace of ds Fisher can be simply expressed 
as - log (±=|)/(2.R), where R = ^Jx 2 + y 2 + z 2 (cf. |l4, 24 ]). Also, at the fully mixed state 
(x = y = z = 0), the metric is simply flat, that is 



dH 2 

u;, Fisher Hll 



-{dx 2 + dy 2 + dz 2 ). 





(7) 



(The Riemann and Ricci tensors evaluated at the fully mixed state have no non-zero entries.) 

Numerical evidence indicates that the Fisher information matrix for the Husimi distribu- 
tion over the TLQS is bounded by the corresponding information matrices for the (quantum) 
monotone metrics, in the sense that the monotone metric tensors minus the Fisher-Husimi 
information tensor are positive definite. 
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FIG. 3: Monotonically- decreasing function /counter factual obtained by equating the volume element 
of ds 2 FisherH to that of a generic monotone metric (JJ) 

We can normalize the volume element of ds 2 FisherH to a probability distribution Ph U s 
by dividing by the Fisher information metric volume ~ 1.39350989367660. If we generate 
a "hybridized- Husimi" (quantum jla|) monotone metric, ds 2 HYBHus , via the formula (0), 
using fjius{t), then the volume of the Bloch ball of TLQS in terms of this newly-generated 



monotone metric is \ it 2 (4 
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4.23607 > 1.39351. Using this as a normalization factor, 



we obtain a probability distribution (phyb Hus ) of interest over the TLQS. 



III. COMPARATIVE NONINFORMATIVITIES 



Let us compare ph us — in the manner employed in |25j, |26j (cf. [2j],|28j, sec. VI]) — with 
the prior probability distribution (p Bures)- The latter is gotten by normalizing the volume 
element of the well-studied minimal monotone (Bures) metric ( 29j, eq. (7)] 3fJ, eq. (16)]), 
that is, 

r 2 sin Ox ^ 



PBv 



7T 2 VT 



generated from (JTJ) using the operator monotone function fsuresit) = (We avoid the 
specific designations f m inif) and fmaxif) because these are usually, confusingly, considered 
to generate the maximal and minimal monotone metrics, respectively eq. (3.21)]. Our 
integrations of probability distributions are conducted over r e [0,1], 9% G [0, 7r] and 9 2 € 
[0,2vr].) 

The relative entropy (Kullback-Leibler distance) of p Bures with respect to puus [which we 
denote Skl(v 'Bures ,Phus)] — that is, the expected value with respect to PBures of log PBures - 

PHus 
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is 0.130845 "nats" of information. (We use the natural logarithm, and not 2 as a base, with 
one nat equalling 0.531 bits.) Let us note that the Shannon entropy (Sshannon) of the Husimi 
distribution is the Wehrl entropy (Swehri) of the corresponding quantum state. Explicitly 



implementing 



31 



eq. (6)], we have for the TLQS, 

1 /. . . . . o,. I 



Swehn = ^ (2r + 4r log 2 + (1 + r 2 ) log (j-^-) - 2r log (1 - r 2 )) . (9) 

Swehri is always greater than the von Neumann entropy, S v n = — Trp In p, which for the 
TLQS is expressible as 

S vN = l - (2 log 2 + r log (1^) - log (1 - r 2 )) . (10) 

(We, of course, notice the omnipresence in these last two formulas, as well as in (jSJ) and 
further formulas below of the term W = log (y^)- The two eigenvalues [Ai, A2 = 1 — Ai] of 
p are ^f 1 , so W is expressible as log(^).) Each monotone metric can be obtained in the 
form of a "contrast functional" for a certain convex subset of relative entropies j^l, 3|. 



A. Bures prior 

Now, let us convert psures to a posterior probability distribution (post Bures ) by assuming 
the performance of six measurements, two (with one outcome "up" and the other "down" ) 
in each of the x—, y- and z-directions. Normalizing the product of the prior psures and the 



likelihood function corresponding to the six measurement outcomes |25|, p. 3] 



192p Bures (l-x 2 )(l-y 2 )(l-z 2 ) 

pOSt Bures = ^ , (H) 

we find S KLipost Bures, Phus) = 0.0912313 < 0.130845. (The cartesian coordinates in (JTTJ) 
are transformed to the spherical ones employed in our analysis.) So, in this sense p Bures is 
more noninformative than pn U s, the relative entropy being reduced by adding information 
to p Bures- On the other hand, p Bures — corresponding to the minimal monotone metric - 
is itself the least noninformative of the monotone-metric priors (Pmonotone) 125J- (Luo has 
established an inequality between the [monotone metric] Wigner-Yanase skew information 
and its minimal monotone counterpart 

Reversing the arguments of the relative entropy functional, we obtain SKL(PHus,PBures) = 
.0818197. But now, following the same form of posterior construction, we find 



SKL{postHus,PBures) = 0.290405 > .0818197, further supportive of the conclusion that pBures 
is more noninformative than puus- in some sense, then, psures assumes less about the data 
than Phus- But this diminishability of the relative entropy is limited. If we convert psures 
to a new posterior PostBures using the square of the likelihood function above — that is, 
assuming twelve measurements, four (with two outcomes "up" and the other two "down") 
in each of the x—,y- and z-directions, giving 

PostBm , = 2 1504 PBTO [(l-^)(l-^)(l-^)]"- m = 2 (12) 

then SKL(PostBures,PHus) = 0.292596 > 0.130845. To much the same effect, if we use a 
likelihood based on the optimal /nonseparable set of measurements for two qubits, consisting 
of five possible measurement outcomes, given in 3, eq. (8)], to convert PBures to a new 
posterior, then the relative entropy reaches higher still, that is from 0.130845 to 0.623855. 
(Employing a likelihood based on the optimal/nonseparable set of measurements for three 
qubits, consisting of eight possible measurement outcomes [35J, eq. (9)], the relative entropy 
with respect to Phus increases further to 1.51365.) Actually, if we formally take m = | in 
eq. (|12|1. and renormalize to a new posterior, we obtain a superior reduction, that is, to 
0.07167 < 0.0912313. (Further, with m = §, we get 0.0702389 and 0.0732039, with m = §.) 



B. Morozova-Chentsov prior 



In |25j , it was found that the ( "Morozova-Chentsov" ) prior distribution, 



.00513299[log(g)] 2 sin 6 X 



Pmc = H , (13) 



r 



that is, the normalized volume element of the monotone metric based on the operator 
monotone function, 

- (iTiHiSr (14) 

was apparently the most noninformative of those (normalizable) priors based on the op- 
erator monotone functions that had been explicitly discussed in the literature. Now, 
Skl{pmc,Phus) — 1.37991, that is, quite large. This can be reduced to 0.893996 if, into 
Pmc, one incorporates m = 6 measurements of the type described above; diminished further 
to 0.561901 with m = 12; and further still to 0.471852 — the greatest reduction of this type 
- with m = 18. (For m = 24, it starts to rise to 0.652441.) 
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But, if we again use the likelihood based on the optimal nonseparable measurement of 
two qubits [25J, eq. (8)], with just five measurements, the relative entropy of the corre- 
sponding posterior form of puc with respect to Phus is reduced to 0.342124, which is the 
smallest we have achieved so far along these lines. (For the mentioned optimal nonseparable 
measurement scheme for three qubits, the reduction is quite minor, only to 1.33492 nats.) 
We obtained intermediate-sized reductions to 0.45524 and 0.492979, respectively, by using 
for our measurements, twenty projectors oriented to the vertices [36, sees. 9, 10] of a dodec- 
ahedron and of an icosahedron. (The primary measurement scheme used above, and in [25], 
with six measurements oriented along three orthogonal directions, is tantamount to the use 
of an octahedron.) 

C. Hilbert-Schmidt prior 

The prior distribution generated by normalizing the volume element of the Hilbert- 



Schmidt metric over the Bloch sphere is |25j, eq. (10)] [16J, eq. (31)] 



r 2 sin#i 

Phs = 3 ^^> ( 15 ) 

which is simply the uniform distribution over the unit ball. The Hilbert-Schmidt volume 
element can be reproduced using the formula (JTJ for a quantum monotone metric, making 
use of fns = ^"/^ , but this function is neither monotone-increasing nor decreasing over 

t e [0,1] (cf. 0). 

We have that Skl(ph us ,Phs) — 0.0579239 and Skl(phs,Ph us ) = 0.05443. Now, in 
terms of our usual posterior distributions based on six measurements, SKL(postH US ,PHs) = 
0.0236596 and S Kbipost hs i Phus) = 0.278953, so we can conclude that the Husimi prior phus 
is more noninformative than the Hilbert-Schmidt prior phs- 



IV. UNIVERSAL DATA COMPRESSION 

Employing phus as a prior distribution (Jeffreys' prior) over the family (Riemannian man- 
ifold) of Husimi qubit probability distributions, the (classical) asymptotic minimax/maximin 



redundancy of universal data compression is equal to (18J, eq. (2.4)] [17] . 



3 ft 3 Th 3 

- log — + log 1.39350989367660 = - log — + 0.331826 = - log n - 3.92499, (16) 
2 lire 2 Zne 2 
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where n is the sample size (the number of qubits [TLQS]) and we used the before-mentioned 
volume of ds 2 FisherH . ("Suppose that X is a discrete random variable whose distribution 
is in the parametric family {Pq : 6 G 0} and we want to encode a block of data for 
transmission. It is known that a lower bound on the expected codeword length is the 
entropy of the distribution. Moreover, this entropy bound can be achieved, within one bit, 
when the distribution is known. Universal codes have expected length near the entropy no 
matter which member of the parametric family is true. The redundancy of a code is defined 
to be the difference between its expected len gth and its entropy" \v\ . p. 459].) 

For the quantum/ nonclassicaA counterpart [38^ (cf. 0, Q, Q] ) > let us consider the use of 



the "Grosse-Krattenthaler-Slater" ( "quasi-Bures" ) probability distribution [35J, eq. (33)], 



0.0832258e/l 



Pgks = \_ r2 (y^) T Sin ^' (17) 

This is the normalized form of the monotone metric (JT]) associated with the (presumably 
operator) monotone function, 

+*/(*-!) 

fGKsit) = — ^— . (18) 

(Taking limits, we have for the fully mixed state, Jgks^X) = 1 and for the pure states, 
/Gifs(O) = e _1 .) It appears 42] (though not yet fully rigorously established) that the 
(quantum) asymptotic minimax/maximin redundancy, employing pgks as a prior probabil- 
ity distribution over the 2x2 density matrices (and their n-fold tensor products (cf. 43])), is 
|logn — 1.77062. This is greater than the classical (Husimi-Fisher-information-based) ana- 
log (fTB*j) by 2.20095 nats of information. It would seem that this difference is attributable to 
the greater dimensionality (2 n ) of an n-qubit Hilbert space, as opposed to a dimensionality 
of 3n for n trivariate Husimi probability distributions over the TLQS. 

We further note that S K L(PBures,PHYB Hus ) = 0.00636046 and S K L(PHYB Hu3 ,PBures) = 
0.0062714, both being very small. Smaller still, SKL(PBures,PGKs) — 0.00359093 and 
SKL(PGKS,PBures) = 0.00354579 - - whence the designation p qua si-Bures = Pgks- But 
then, even more strikingly, we computed that Skl(pgks,Phyb Hus ) = 0.000397852 and 
Skl(phyb Hus ,Pgks) = 0.000396915. In Fig. HJwe show the one-dimensional marginal prob- 
ability distributions over the radial coordinate r of the five distributions PBures, Phyb Hus , 
Phus, Pgks and Pmc, with those for phyb Hus and Pgks being — as indicated — particularly 
proximate. 
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marg. prob. 




FIG. 4: Plots of one-dimensional marginal probability distributions over the radial coordinate r of 
PBures, Phyb Hus , Pgks, PHus and pmc- The order of dominance of the curves is: p Hus > PBures > 
PGKS > Phyb Hus > Pmc- The marginal distributions of Phyb Hus and pgks are quite close, as 
reflected in their small relative entropy (~ .0004). 

Substitution of Phyb Hus for Pgks into the quantum asymptotic (maximin) redundancy 



42, eq. 



formula that has to be maximized over all possible prior probability distributions 
(4-3)], 

-logn- - - -log2- — log 7r 

+ 4n [ ( -log(l - r 2 ) + — log f - — - J— log w(r)] r 2 w(r) dr, (19) 
Jo V 2r \l + rj J 

leads to a very slightly decreased (and hence suboptimal) redundancy, §logn — 1.77101 vs. 
| logn — 1.77062. (Use of PBures as a quantum prior over the 2x2 density matrices gives us 
a constant term of —1.77421, use of phus, —1.88279 and use of Pmc, —2.15667.) To obtain 
the appropriate form of w(r) to use in (fTUj) . we take our probability distributions (such as 
(JHJ) and (|13|)b divide them by 4ixr 2 and integrate the results over 9\ € [0, tt] and 62 £ [0, 2tt}. 
(Thus, we must have 4tt J w(r)r 2 dr = 1.) The minimax objective function is 

/3 13 3 1/1— r\ \ 

min max ( - logn log 2 log7r — logfl — r 2 ) H log ( I — logiyfr) ) . (20) 

w Q<r<i\2 6 2 2 6 2 6 6V ; 2r 6 V 1 + r / J 

The minimax is also achieved using the w(r) formed from pgks- 

We can, additionally, achieve an extremely good fit to puus by proceeding in some- 
what an opposite fashion to that above - - reversing our hybridization procedure. Em- 
ploying fcKsif), rather than fuusif) in the expression © for ds 2 Fisher and obtaining 
the corresponding normalized (dividing by 4.00277) volume element (Phybgks^ we ^ n< ^ 
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Skl{Phyb gks iPhus) = 0.000316927. (Interchanging the arguments of the relative entropy 
functional, we get 0.000317754.) It is quite surprising, then, that a joint plot of fGKs(t) 
and fHus{t) readily shows them to be substantially different in character (for example, 
/if us (50) = 55.8161 and /gks(50) = 19.9227), since they have been shown here to generate 
two pairs of such highly similar probability distributions, one pair composed of (quantum) 
monotone (pgks and Phyb Hu3 ), and the other pair of (quantum) non-monotone metrics 
(Phyb gks and p Hus ). 



V. ESCORT-HUSIMI DISTRIBUTIONS 



For the escori-Husimi distributions |44j, we raise the probability element of the Husimi 
distribution to the g-th power, and renormalize to a new probability distribution. (Of course, 
the Husimi distribution itself corresponds to q = 1. If we set a = 2q — 1, we recover the 

.) To normalize the q-th power of the Husimi distribution, one 



45 



a-family of Amari |33 , 

must divide by 

2 -i I - (i_ r )i+9 + (i + r )i+9 



r + qr 

A. The case q = 2 

For (entropic index) q = 2, the Fisher information metric takes the form 

12 . , 4 . ,1-r. 



We have f g=2 {l) = § and / g=2 (0) = \. 



(21) 



where 



1. Relative entropies 

Further, the relative entropies S KL (p Hus ,p EsCq=2 ) = 0.0114308 and S KL (p Bures ,p EsCq=2 ) = 
0.42964, So, it appears that pE S c q=2 ^ s even l ess noninformative than pn U s (recalling that 
SKL(PBures,PHus) = 0.130845 < 0.42964), which in turn we found above was less noninfor- 
mative than the prior probabilities formed from any of the (quantum) monotone metrics. We 

12 



also note that SKL{post Buresi pEsc q=2 ) = 0.125159 < 0.42964. If we "hybridize" dsp iaher 
by modifying its radial component into that required of a (quantum) monotone metric, then 
we find that S KL {p Bures ,p HYBq=2 ) = 0.00246031(< (S KL (p Bures ,p HY B Hus ) = 0.00636046) is 
quite small. 



B. The cases q > 2 



For the escort-Husimi probability distribution with q = 3, the Fisher information metric 
takes the form 

ds lisher^ = JY^-2 dr2 + (C 1 + 0/ 9 =3(^))^ 2 , (24) 

where 

= 0ry < 25 > 

Now, / ?=3 (1) = / g=3 (0) = | and a plot of f q=3 (t) clearly manifests monotonic behavior also. 
(The monotonically-decreasing scalar curvature of ds Fisher<] _ 3 equals | at r = 0.) We have 
that S KL (p Bures ,p EsCq=3 ) = 0.63705 > S KL (p Bures ,p EsCq=2 ) = 0.42964, so the informativity 
(noninformativity) of the escort-Husimi prior probabilities appears to increase (decrease) 
with q. 

For q — 4, 

, 2 80(5 - 2r 2 + r 4 ) , 2 . , ,1-t\\-i 2 

dS = 3(5 + 10^ + ^ + (J 1 + r )^(lT7)) dn ' ^ 

where 

f m 3(t 4 + t 3 + t 2 + t + i) 

/Hl J " 4(t+l)(3t 2 + 4t + 3)' 1 ' 



For g = 5 

' /s ' r "^~ (3 + 10r 2 + 3r 4 ) 2 V v ^ ' ' yj<?=ov l + r 



, 2 3(5-r 2 )(5 + 3r 4 ) , 2 w 4-r,\-i 2 

ds Fishera=5 = ; o , 1f / 2 , -TTj V + (1 + r)/ 9=5 (— - ) rfn 2 , (28) 



where 

f (t) 2 ^ 4 + ^ 2 + 1 ) f29) 

/,4 J ' 5(t+l)(2t 2 + t + 2)' 1 j 

We have (as found by C. Krattenthaler, making use of explicit MATHEMATICA computa- 
tions of ours for q = 2, 3, • • • , 40) (cf. 0, sec. 3.2] H), 

(g - ijgLog 



m = 7, r^Ji.r ^ (so) 
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FIG. 5: The monotone functions f q= i(t),i = 1, • • • ,30 that yield the tangential components of the 
Fisher information metric over the escort-Husimi (q = i) probability distributions. The steepness 
of the graphs decreases as q increases 

(For odd q some simplification in the resulting expression occurs due to cancellation by a 
factor of (t + 1).) 

In Fig. we plot f q= i(t),i = 1, • ■ ■ ,30, revealing their common monotonically- increasing 
behavior. (Of course, we have f q =i(t) = fnus(t), shown already in Fig. [TJ The steepness of 
the curves decreases with increasing q.) 

Let us further note that in addition to SKL(PBures,PHYB Hus ) = 0.00636046 

and S KL (p B ures,PHYB q=2 ) = 0.00246043, We have S K L(pBures,PHYB q = 3 ) = 

0.0132258, S K L(pBures,PHYB q=i ) = 0.0238858 and S KL {p Bures ,p HY B q=5 ) = 0.0327578. 
(We have also been able to compute that SKL(PBures,PHYB q = 1000 ) = 0.0969315 and 
Skl(pgks, Phy b 9= iooo) = 0.127027.) So, the best of these fits of psures to the prior 
probabilities for the hybridized-escort-Husimi probability distributions is for q = 2. 

C. Tangential components 

Now, we can reexpress the formula (J3TJ)) without summations, making use of the binomial 
theorem, as 



hit) 



(-l+q) (-1+tf (-l + t 1+ <>) 



(31) 



q (1 + t) (l-q + t + qt-ti - qtfl - t l+ i + qt 1+ i) ' 
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So, we could study hybridized escort-Husimi metrics based on non-integral q using this 
formula. (We note that (|3~T|). in fact, yields lim^i f q (t) = fnusit).) For example, 

/ 9= i(t) = 6 + 6v / t + 2t- r ^. (32) 

Thus, gives us (following the formulation (JTj)) the tangential components of the escort- 
Husimi Fisher information metrics for arbitrary q. (Pennini and Plastino j^J have argued, 
though, that in a quantal regime, q can be no less than 1. Tsallis statistics with an en- 
tropic index of q = §, Beck has contended, correctly describes the small-scale statistics of 
Lagrangian turbulence |4 



D. Radial components 

We do not have, at this point, a comparable complete formula for the radial components. 
However, C. Krattenthaler has shown — making use of explicit computations of ours for the 
cases q = 2,3, ■ ■ ■ ,18 — that the denominators of the functions giving the radial components 
are simply proportional to 

( a Pochhammerlq — 2i + l,2i + l]r 2i \ 2 

u{q) = ( EU 2{2lTT)\ J ■ (33) 

(The Pochhammer symbol is synonymous with the rising or ascending factorial. The 
obtaining of comparable formulas for the numerators of the radial components might 
be possible using the "Rate.m" program available from the website of Krattenthaler 
http:/ /www.mat.univie.ac .at^kratt/| , if we had available additional explicit computations 
beyond the q = 18.) As way of illustration, the radial component of dsp isher ^ 8 is expressible 
as 

144 (21 + 42 r 2 + 135 r 4 + 28 r 6 + 35 r 8 - 6 r 10 + r 12 ) 

7u(8) ' 

VI. POSITIVE P-REPRESENTATION FOR TLQS 



(34) 



Braunstein, Caves and Milburn focused on a specific choice of positive P-representation 
which they called the canonical form and which is always well defined pui eq. (3.3)] (cf. 
sec. 6.4]): 

P can {a,f3*) = ^exp(-i|a-/3| 2 ) < I («+/?) |p|i( a+ /3) >= exp (~\ a - p\ 2 )Q(^(a+p)). 

(35) 
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"The canonical form is clearly positive, and. ..it is essentially the Q-function [Husimi distri- 
bution]" 0- 

We sought to implement this model, choosing for a and (3 independent 2-dimensional rep- 
resentations of the spin-| coherent states (while for the Husimi distribution or Q-function, 
only, say a, need be employed). (The "positive P representation achieves [its] consider- 
able success by doubling the number of degrees of freedom of the system, i. e., doubling 
the number of dimensions of the phase space" [sol - p. 1153]. More typically, in the posi- 
tive P-representation, a and (3 are allowed to vary independently over the entire complex 
plane.) However, then our result — using this choice of a and (3 — was not normalized to 
a probability distribution in the manner indicated in (J35j) . 

We noted that Braunstein, Caves and Milburn had commented that a "positive P repre- 
sentation can be defined for a large class of operators. We restrict ourselves here to those that 
are built up from the standard annihilation and creation operators of a harmonic oscillator. 
In particular, our work does not apply to generalizations of the positive P representation 
that include spin or pseudospin operators often used to describe a two-level atom" 0, p. 
1155]. (We are not aware, however, of any specific applications reported in the literature of 
the positive P-representation to n- level [finite-dimensional] quantum systems.) 

We did not perceive how to exactly (re)normalize the distribution (J35j) for our particular 
choices of a and (3. So, we expanded just the exponential term of into a power series 
in third order in the four phase variables and exactly normalized the product of this series 
with the remaining unmodified factor (the Q-function or Husimi distribution) to obtain a 
new (presumed) probability distribution. We then fit (numerically) the resultant tangen- 
tial component of the associated Fisher information metric to the form (Q) required of a 
monotone metric. In Fig. |U] we show what we (gratifyingly) obtained in this manner for 
fp(t). In Fig. [7|we show an approximation to the radial component of ds Fisherp , similarly 
obtained. (The positive P-function "seems to possess some interesting properties and may 
deserve close inspection" 3, P- 175].) It would be of interest to see how near the associated 
probability distributions (pp and Phyb p ) would be to the probability distributions (already 
discussed above) Pgks^Vhus^Vhyb Hu!> and P HYBgks - Most pressing, though, is the question 
of whether or not the concept of a positive P-representation does, in fact, have a meaningful 
and natural theoretical application to the n-level quantum systems. 
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FIG. 6: Approximation to the presumed operator monotone function fp(t) yielding the tangential 
component of ds 2 Fisherp for the positive P-representation over the two-level quantum systems 

rad. comp . 




FIG. 7: Approximation to the radial component of ds 2 Fisherp for the positive P-representation over 
the two-level quantum systems 

VII. GAUSSIAN DISTRIBUTION 



An approach quite distinct from that of the Husimi probability distributions, but still 
classical in nature, to modeling quantum systems has been presented in DDODD (cf. 

Here the family of probability distributions is taken as that of the Gaussian (complex 
multivariate normal distributions') having covariance matrix equal to the density matrix. For 
the TLQS, Slater 3, eq. (13)] 54, eq. (16)] derived the corresponding Fisher information 
metric. This is representable as, 



ds 2 

Fisher Gaus 



2(l + r 2 ) j2 2 , 2 
-ar + -an . 



(36) 



(1 — r 2 ) 2 I — 

The tangential component can be reproduced, following the basic formula (0), by choosing 
fcaussit) = j!-. This is simply one-half of that — frhit) = 2f Gauss {t) = ^ — associated 
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FIG. 8: Radial components of ds 2 monotone and ds|v„f,™._ . The latter dominates the former. 



Fisher Gaus 



with the maximal monotone (Yuen-Lax) metric [55(. Like that metric, the metric (J36)) yields 
a non-normalizable volume element (so one can not immediately apply — without some 
preliminary truncation — the comparative noninformativity/relative entropy test we have 



used above j3, l26l|). Of course, the radial component of is also not consistent with 
the requirement for a monotone metric. In fact, it rises much more steeply than jz^z, in 
opposite behavior to that for ds 2 FisherHus . In Fig. IHlwe show this phenomenon. 



VIII. DISCRETE WIGNER FUNCTION FOR A QUBIT 



The discrete Wigner function (pseudoprobability) W, in the simplest case of a qubit, is 
defined on a 2 x 2 array, with four components Wij,i,j = 1,2 j^y, eqs. (14)-(17)]. The 
sum of Wij in each "line" A is the probability pij of projecting the state onto the basis 
vector |cKy >, where i G {1,2,3} indexes a set of three mutually unbiased bases (MUB) 
for a qubit and j G {1, 2} indexes the basis vector in each MUB. Choosing the MUB to be 
the eigenstates of the three Pauli operators, and using our cartesian coordinates, one can 
obtain three one-dimensional marginal (binomial) probability distributions over the x—,y— 
and z-axes, of the form (^jr, ^f 2 ), • • • (cf. 0, S|). Now, the corresponding Jeffreys' prior 
for the one-dimensional family of such binomial distribution is simply the beta distribution 
Pp{x) = ;~7j=5 ■ (Let us note that the one- dimensional marginal distributions obtained for 
PBures are of another form, that is, 2 ^ 1 ~ x2 .) 

Let us take the product of pp(x),Pp(y) and pp(z), which naturally forms a (prior) prob- 
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FIG. 9: Scalar curvature of the Fisher information metric for the family of Husimi distributions 



ability distribution, 



Pproduct 



(37) 



7rV(i-z 2 )(i-y 2 )(i-z 2 )' 

over the hypercube with vertices (±1,±1,±1) and renormalize/truncate it to a probability 
distribution over the Bloch sphere, 



1 



PWigner 



(38) 



6.61455516101^(1 - x 2 )(l - y 2 ){l - z 2 ) 

(Thus, the quantum-mechanically inaccessible region lying outside the Bloch ball, but within 
the hypercube is disregarded — assigned null measure — in the new normalization.) 

Now, we found — strictly following the notation, formulas and line of argument above 
in sec. |ITT] — that S K L{pwigner,PHus) = 0.0149831 and S K l{phus,Pw 



igner ) 



0.0156225, so 



these two distributions are rather close in nature. Of course, phus is rotationally-symmetric 
over the Bloch sphere, while pw igner is not, so it seems to make little sense to try to compute 
some function fwigner{t) to generate the tangential component. We found it problematical, 
using our usual (relative entropy) approach, to designate either Ph us or pwigner as more or 
ess noninformative. (The "Husimi function is a kind of... coarse- grained Wigner function" 



4a p. 3].) 



IX. SCALAR CURVATURE 



In Fig. El we plot the scalar curvature of ds 2 Fisher . The formula for this scalar curvature 
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IS 

R n=2 _ r (-Qr + W (-3 + r 2 )) (-4r 2 (-3 + r 2 ) + QWr (2 - 3 r 2 + r 4 ) + W 2 (3-8r 2 + 5r 4 )) 

Hus ~ (W + 2rf (-1 + r 2 ) (-2r + W (-1 + r 2 )) 2 ' 

(39) 

where = log Also, expanding about r = 0, 

„_ 9 -6r 2 138 r 4 32094 r 6 154474 r 8 57710054 r 10 . , 

K^j~i « . (40) 

5 125 30625 153125 58953125 v y 

The nonpositive monotonically- decreasing scalar curvature (Fig.[0J) has its maximum at r = 0, 

corresponding to the fully mixed state, indicative of a flat metric there (cf. (JJJ)) (and is — oo 

at the pure states, r = 1). For the minimal monotone (Bures) metric, the nonnegative scalar 

curvature is constant, that is K^~ 2 = 6, over the Bloch ball, and for the (n 2 — l)-dimensional 

convex set of n x n density matrices, n > 2, achieves its minimum of K^ in = — ~ 4 g < ~" ~^ 

at the fully mixed state (p = -I) j^]. (In j^], the metric used is one-quarter of that 

corresponding to (0), used here, so the results we compute here differ from those there by 

such a factor. For the maximal monotone metric, ^ = ^y— which is monotonically- 

decreasing as r increases, as is K^j~ 2 .) 

For the two-level quantum sytems, Andai j^] has constructed a family of monotone 

metrics with non-monotone scalar curvature, and given a condition for a monotone metric 

to have a local minimum at the maximally mixed state. 

A. Metrics of constant scalar curvature 

The metric ds 2 Fisherq _ 2 has cons tant scalar curvature, K™= 2 = §> while, as previously noted, 
K^~ 2 = 6. Let us note that K^ Y = |( n2 ~ 1)(«- 2 — 2), which is also | for n = 2. Here, 
WY denotes the Wigner-Yanase metric — the only pull-back metric among the quantum 
monotone metrics — and fwyit) = ^(y^+l) 2 , which is the only self-dual operator monotone 



function j6l|. "It is not known at the moment if there are other monotone metrics of constant 
sectional and scalar curvature" p. 3760]. It is a theorem that the "set of two-dimensional 
normalized density matrices equipped with the Bures metric is isometric to one closed-half 



of the three-sphere with radius 



2 



The WY-metric "looks locally like a sphere of radius 



2 of dimension (n 2 — 1)" fol P- 3759]. If we transform to spherical coordinates on the 
3-sphere, then, the metric tensor for ds 2 min is diag onal in character, while the two other 
(constant scalar curvature) metrics are not (cf. 63j| ) . 
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The three metrics ds 2 miw ds^y an d ds 2 Fisher _ 2 are Einstein. If we scale these metrics 
so that they are all of unit volume Q, then K^ /scaled = 6vr 2 « 59.2176, tf^ /MBted = 
67r(7r-2) « 21.5185 and K^ /scaled = 4n 2 -6y/3ir w 6.83003. The constant scalar curvatures 
of (unit-volume) Yamabe metrics are bounded above, and their least upper bound is a real 
number equal to n(n — l)Vn^ r \ where V n is the volume of the standard metric on S n , and 
in our (Bloch sphere) case, n = 3, so the bound is 242^ttI « 139.13 Q. 



X. DISCUSSION 

Luo 



24| (cf. |44j, sec. 2.4] 



20j,|65j) has calculated the Fisher information matrix of the 
Husimi distribution in the Fock-Bargmann representation of the quantum harmonic oscilla- 
tor with one degree of freedom. He found that the Fisher information of the position and 
that of the momentum move in opposite directions, and that a weighted trace of the Fisher 
information matrix is a constant independent of the wave function, and thus has an upper 
bound. (Luo did not consider the possibility of generating prior probability distributions by 
normalizing the volume element of the Fisher information metric.) 

Gnutzmann and Zyczkowski noted that one "is tempted to think of the Husimi function 
as a probability density on the phase space. However, the rules for calculating expectation 
values of some observable using the Husimi function are non-classical" 47, sec. 2.11 (cf. 

p. 548]). Gardiner and Zoller remarked that the "main problem of the Q-function is 
that not all positive normalizable Q-functions correspond to positive normalizable density 



operators" 



51 



p. 109]. 

Further, the comparison of distances between Husimi distributions for arbitrary quantum 
states based on the Fisher information metric with those employing the Monge distance jjj , 
might be investigated. For the TLQS studied here, the Monge distance is, in fact, "consistent 
with the geometry of the Bloch ball induced by the Hilbert-Schmidt or the trace distance" 0, 
p. 6716]. (The trace distance is monotone, but not Riemannian, while the Hilbert-Schmidt 

i ' n n 

distance, contrastingly, is Riemannian, but not monotone |67j, p. 10083] |37j.) For n- 
dimensional quantum systems (n > 2), unlike the trace, Hilbert-Schmidt or Bures distance, 
the Monge distance of p to the fully mixed state — which provides information concerning 
the localization of p in the classical phase space — is not the same for all pure states j3j. 
The only monotone metrics for which explicit distance formulas are so-far available are the 
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FIG. 10: Statistical distance as a function of distance from the origin of the Bloch ball — corre- 
sponding to the fully mixed state — for any monotone metric, for ds Fisher , and for the Monge 
(or equivalently, for n = 2, Hilbert-Schmidt) metric. The monotone-metric curve dominates that 
for ds 2 FisherH , which dominates the linear curve for the Monge metric. 



Bures (minimal monotone) and Wigner-Yanase ones 

In Fig. El we show how the distance from the fully mixed state (r = 0) increases as 
r increases, for any monotone metric and for ds 2 FisherH , and (linearly) for the Monge (or 
Hilbert-Schmidt) metric. The first-mentioned distance — taking the functional form arcsinr 
(equalling ~ for r = 1) — dominates the second-mentioned distance (equalling 4 5551532167057 



for r = 1), which in turns dominates the third [3|, eq. (4.10)], which takes the value | for 
r = 1. 

Let us bring to the attention of the reader, a recent preprint, which introduces a concept of 
escort density operators and a related one of generalized Fisher information 68] (cf. 0,|k|). 

We have been consistently able above to find (apparently operator) monotone func- 
tions to generate the tangential components of (classical) Fisher information metrics for 
(rotationally-symmetric) probability distributions over the TLQS. We suspect the existence 
of some (yet not formally demonstrated) theorem to this effect. Also, it would be of interest 
to formally test the various monotone 



for a quantum monotone metric 
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'unctions presented above for the property (requisite 
21 1 of operator monotonicity. 
We have "hybridized" ds Fisher above to a (quantum) monotone metric ds 2 HY B Hus by 
replacing its radial component by that required (•jztt)) while retaining its tangential com- 
ponent (formed from fuusif))- But it appears that we could also convert it by appropriately 
scaling (a conformal transformation) the entire metric (tangential and radial components) 
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by some suitable function. If we do so, we find that — by explicit construction — the new 
metric {.ds 2 con f ormalHus ) has the required radial component, while the tangential component 
is generated by a function 

f con formal Hus^) fllusij^) t 1; (^-Q 

which also appears to be operator monotone. (We note that fconformai Hus (l) — 1 
and \im t ^o f con formal Hua (t) = 0.) But now, we have the large relative entropies 

V con f ormal fj ua 

) = 50.4636 and SKL{p C onformai Hus ,PGKs) = 54.2601. At r = 0, 
ds2 confor m ai Hus is not flat > as is ds Fisher Hus , but has a (limiting) scalar curvature of -f . 



A. Further questions 

Motivated by the analyses above, we would like to pose the question of whether there 
exists a family of trivariate probability distributions parametrized by the points of the Bloch 
ball, for which the associated (classically unique [up to a constant multiple]) Fisher informa- 
tion metric fully — both in terms of tangential and radial components - has the requisite form 

for a monotone metric. Also, the volume elements (and hence associated prior probabil- 
ities) of the monotone metrics are expressible as the product of Haar measure and measures 
over the eigenvalues Q|. To what extent, if any, does this hold true for prior probabilities 
not arising from monotone metrics? Are there any non-monotone metrics which give rise 
to prior probabilities more noninformative than (at the very least) the minimal monotone 
(Bures) one? What are suitable counterparts to formula for n-level quantum systems 
(n > 2)? Are there any monotone metrics which are flat at the fully mixed state, as is 
ds 2 FisherHus ©? 



XI. SUMMARY 



In a classical context, for the family of Husimi probability distributions over the three- 
dimensional Bloch ball of two-level quantum systems (TLQS), we derived the (flat-at-the- 
fully-mixed-state) Fisher information metric {ds 2 Fisher , given by ©). Its tangential — but 
not its radial (r) — component conformed to that of one of the (uncountably) infinite class 
of (quantum) monotone metrics. The prior probability distribution (pn U s) formed by nor- 
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malizing the volume element of ds Fish was found (sec. IIII A|) to be considerably less non- 
informative than the priors formed from any of the (quantum) monotone metrics, even that 
(pBures) based on the (relatively informative) minimal monotone (Bures) metric. However, 
if we replaced the radial component of ds Fish by that required {jz^s) of all (quantum) 
monotone metrics, the resultant "hybridized- Husimi" prior probability (phyb Hus ) became 
very close (in the sense of relative entropy ~ .006 "nats") to psures, and thus comparably in- 
formative in nature, but even nearer (^ .0004) to another quantum-monotone-metric-based 
("Grosse-Krattenthaler- Slater" or "quasi-Bures" ) probability distribution (pgks) that has 
been conjectured to yield the asymptotic minimax/maximin redundancy for universal quan- 
tum coding. The analogous (Bayesian) role in universal (classical) coding — - by a well-known 
result of Clarke and Barron — is played by Jeffreys' prior (cf. [19, 20]). This takes the 



specific (original, non-hybridized) form Phus for the family (manifold) of trivariate Husimi 
qubit probability distributions under study. We also studied the Fisher information metric 
for the escori-Husimi (sec. IVjl . positive-P (sec.lVIjl and certain Gaussian probability distribu- 
tions (sec lVHJ) . as well as, in some sense, the discrete Wigner pseudoprobability (sec. IVIII^ . 
Additionally, we applied the Clarke comparative noninformativity test j3, |3| to quantum 
priors (sec. IIII)) . Evidence that this test is consistent with the recently-stated criterion of 
"biasedness to pure states" of Srednicki (^tJ has been presented j3| . 
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